On Anomaly Ranking and Excess-Mass Curves, Supplementary Material

نویسندگان

  • Nicolas Goix
  • Anne Sabourin
  • Stéphan Clémençon
چکیده

Let t > 0. Recall that EM∗(t) = α(t) − tλ(t) where α(t) denote the mass at level t, namely α(t) = P(f(X) ≥ t), and λ(t) denote the volume at level t, i.e. λ(t) = Leb({x, f(x) ≥ t}). For h > 0, let A(h) denote the quantity A(h) = 1/h(α(t + h) − α(t)) and B(h) = 1/h(λ(t + h) − λ(t)). It is straightforward to see that A(h) and B(h) converge when h→ 0, and expressing EM∗ ′ = α′(t)−tλ′(t)−λ(t), it suffices to show that α′(t)−tλ′(t) = 0, namely limh→0A(h)−t B(h) = 0. Now we have A(h) − t B(h) = 1 h ∫ t≤f≤t+h f − t ≤ 1 h ∫ t≤f≤t+h h = Leb(t ≤ f ≤ t+h)→ 0 because f has no flat part.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Anomaly Ranking and Excess-Mass Curves

Learning how to rank multivariate unlabeled observations depending on their degree of abnormality/novelty is a crucial problem in a wide range of applications. In practice, it generally consists in building a real valued ”scoring” function on the feature space so as to quantify to which extent observations should be considered as abnormal. In the 1d situation, measurements are generally conside...

متن کامل

How to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?

When sufficient labeled data are available, classical criteria based on Receiver Operating Characteristic (ROC) or Precision-Recall (PR) curves can be used to compare the performance of unsupervised anomaly detection algorithms. However, in many situations, few or no data are labeled. This calls for alternative criteria one can compute on non-labeled data. In this paper, two criteria that do no...

متن کامل

Title : Adaptive Anomaly Detection using Isolation Forest

Ranking measure is of prime importance in anomaly detection tasks because it is required to rank the instances from the most anomalous to the most normal. This paper investigates the underlying assumptions and definitions used for ranking in existing anomaly detection methods; and it has three aims: First, we show evidence that the two commonly used ranking measures—distance and density—cannot ...

متن کامل

Supplementary Material: Entropy Measures for Stochastic Processes with Applications in Functional Anomaly Detection

Supplementary Material: Entropy Measures for Stochastic Processes with Applications in Functional Anomaly Detection Gabriel Martos 1, ̊, Nicolás Hernández 2, Alberto Muñoz 2 and Javier M. Moguerza 3 1 Universidad de Buenos Aires and CONICET; [email protected] 2 Universidad Carlos III de Madrid; {nihernan,albmun}@est-econ.uc3m.es 3 Universidad Rey Juan Carlos; [email protected] ...

متن کامل

Anomaly Ranking as Supervised Bipartite Ranking

The Mass Volume (MV) curve is a visual tool to evaluate the performance of a scoring function with regard to its capacity to rank data in the same order as the underlying density function. Anomaly ranking refers to the unsupervised learning task which consists in building a scoring function, based on unlabeled data, with a MV curve as low as possible at any point. In this paper, it is proved th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015